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Abstract. Low- noise phased arrays are essential for the next generation of microwave 
and submillimetre wave astronomy. We analyze their behaviour from a functional 
perspective, and show that their operation is intimately related to the mathematical 
theory of frames. No assumptions are made about the orthogonality or linear 
independence of the synthesised beams. Frame theory allows an unambiguous 
assessment of whether the outputs of an array can be used to observe a field or 
brightness distribution within a given class. Image reconstruction is carried out using 
dual beams. We identify the natural modes of phased arrays, and carry out an analysis 
of noise. The scheme allows the expectation values, the mean-square fluctuations, and 
the correlations between fluctuations at the output ports of a phased array to be 
determined for a source in any state of spatial coherence. Both classical and photon- 
counting statistics are included. Our model is conceptually powerful, and suggests 
many simulation and image recovery techniques. 
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1. Introduction 

Recently, there has been a surge of interest in developing phased arrays for radio 
astronomy. Projects include the Square Kilometer Array (SKA), the Low Frequency 
Array (LOFAR), the Electronic Multibeam Radio Astronomy Concept (EMBRACE), 
the Karoo Array Telescope (KAT) [TJ El [3, H] , and a number of instruments for enhancing 
the performance of single-dish telescopes. Most of the current projects are aimed 
constructing phased arrays for microwave astronomy, but as technological capability 
improves, phased arrays will also be constructed for far infrared and submillimetre wave 
astronomy j5j E] . 

Two types of phased array are of interest: (i) imaging phased arrays, where an array 
of passive antennas, or coherent receivers, is connected to a beam-forming network such 
that synthesised beams can be created and swept across the sky; (ii) interferometric 
phased arrays, where the individual antennas of an aperture synthesis interferometer are 
equipped with phased arrays such that fringes are formed within the synthesised beams. 
In this way it is possible to extend the field of view, to observe completely different 
regions of the sky simultaneously, to steer the field of view electronically, and to observe 
spatial frequencies that are not available because the baselines of an interferometer 
cannot be packed more tightly than the diameters of the individual telescopes. 

It is important to appreciate that the synthesised beams of a phased array need 
not, mathematically speaking, be orthogonal and may even be linearly dependent. Non- 
orthogonality may be built into a system intentionally as a way of increasing the fidelity 
with which an image can be reconstructed, or it may arise inadvertently as a consequence 
of RF coupling and post-processing cross-talk. In some situations, say in the case of 
interacting planar antennas, it may not even be clear how to distinguish one antenna 
from another, even before the beam-forming network has been connected. 

Despite the considerable importance of phased arrays for astronomy, there is still 
a lack of information about the principles of operation when the synthesised beams 
are non-orthogonal, or even linearly dependent, and when noise is included. In this 
paper we describe the operation of low-noise imaging phased arrays from a functional 
perspective, and show that information throughput and image recovery are intimately 
related to the mathematical theory of frames. We also show that it is only necessary to 
know the synthesised beams in order to calculate the average powers, the correlations 
between the complex travelling wave amplitudes, the fluctuations in power, and the 
correlations between the fluctuations in power (the Hanbury Brown-Twiss effect) at the 
output ports of an array. 

The paper provides a powerful conceptual framework for understanding the 
operation of phased arrays. A key feature of the analysis is that we do not fill the half 
space between the aperture plane and the sky with a complete set of plane waves, but 
limit our attention to the natural optical modes of the system. In this way, issues relating 
to information throughput, sensitivity, noise, correlations, and quantum statistics can 
be dealt with in a straightforward manner. Indeed, the quantum statistical properties 
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of the incoming radiation field can be taken into account, for a source in any state of 
spatial coherence, and the transition from fully bunched to photon-counting statistics 
included as the wavelength of operation moves from the microwave range through into 
the far infrared. 

2. Basic Principles 

In general an imaging phased array comprises a sequence of optical components, an array 
of single-mode receivers, each of which has a primary beam pattern, and an electrical 
beam-forming network such that each output port corresponds to a synthesised reception 
pattern on the sky. The synthesised reception patterns may be static and designed to 
give optimum sampling on a given class of object, or they may be controlled electrically 
and swept across the field of view. In the case of radio astronomy, the optical system 
would be a telescope, the single-mode receivers would be horns or planar antennas 
coupled to HEMT amplifiers or SIS mixers, and the beam-forming network would be a 
system of microwave or digital electronics. 

The following analysis is based on a generic system comprising an array of M horns 
and a beam-forming network having P output ports. Each of the P ports is associated 
with a synthesised reception pattern. For convenience, A denotes the input reference 
surface, which is a far-field region of the sky, B the output ports of the horns, and C the 
output ports of the beam-forming network. 

When a fully coherent field is incident on the system, a set of travelling waves 
appears at £>, and we shall denote their complex amplitudes by {y m : m G 1, • • • , M}; 
also, a set of travelling waves appears at C, and we shall denote their complex amplitudes 
by {z p : p G 1, • • • , P}. Because M and P are finite, the complex amplitudes can be 
assembled into column vectors y G C M and z G C p . If E(f2) is the plane-wave spectrum 
of the incident electric field, and ft a unit radial vector in the aperture plane pointing 
towards the sky, then it can be shown that because the synthesised reception patterns 
and incoming field are square-integrable functions, which can be represented by vectors 
in the Hilbert space of square integral functions over A, the complex amplitude of the 
travelling wave at port p can be written 

Zp = [ t;(ti) ■ E(n) dn, (1) 

J A 

where t p (Q) is the complex synthesised reception pattern of port p. It would be 
naive to assume, however, that if the array is illuminated by a field having the form 
E(f2) = tp(O), a travelling wave only appears at p. 

It is instructive to derive a relationship between the synthesised beams and the 
beams of the primary antennas. If the beam patterns of the primary antennas are 
denoted by h m (f2) then the outputs of the antennas, y m , are given by 

Vm = f h*jn).E(£i)dn, (2) 

J A 
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but the beam- forming network can be described by scattering matrix having elements 



b vm , where 



z p — ^ j ^pmymi (3) 
in 

and therefore substituting (j2j) in (j3J), 

^<P Pm h* m (n)-E(n)dn, (4) 



Zp 



A ' 



which can be cast into the form of (0Q) by defining 

t p (o) = ^; ro h m (n). (5) 

m 

As expected, the synthesised reception patterns are weighted linear combinations of the 
primary antenna patterns. 

Even when the primary beams are orthogonal, and identical, the synthesised beams 
do not have to be orthogonal, because 

[ t;(n)-t p ,(n)dn = kJ2<P Pm <p;, m , (6) 

J A 

where (jSJ) has been used, k is a constant that derives from the inner products of the 
primary antenna patterns, 



h* m (n)-h m ,(n)dn = k5 mm> , (7) 

I A 

and there is no restriction on the orthogonality of the pm . can be written in the 
form of a matrix equation: 

J t;(n)-t p ,(Ci)dn = k[*&] ppl . (8) 

If M of the columns of $ f : C p -> C M are linearly independent, and P > M, then 
the complete set of columns constitutes an over-complete basis, in the sense that there 
are P vectors in an M-dimensional space. In this case, except trivially when certain 
ports are not connected, the synthesised reception patterns are not orthogonal, because 
$$1" 7^ D, where D is a diagonal matrix of dimension P; indeed, the synthesised 
reception patterns are linearly dependent. Contrariwise, the columns of & comprise an 
under-complete basis if P < M, and again we may have 7^ D. In the case where $ 
is unitary, = Ip, where Ip is the identity matrix of dimension P, and the primary 
beam patterns are orthogonal, the synthesised reception patterns are orthogonal. We 
conclude that, except in the simplest of scenarios, the synthesised reception patterns 
are unlikely to be orthogonal, 

t;(ft).v(n)dn^fciv, (9) 



A 



and may even be linearly dependent. According to (0Q) and (jHJ), travelling waves will 
generally appear at more than one port even when the incident field has the form of one 
of the synthesised reception patterns: E(fi) = ty(O). 
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(jSD indicates that the synthesised beams are weighted linear combinations of the 
primary antenna patterns; or conversely, that there must be sufficient antennas of the 
correct form, such that all of the required synthesised reception patterns, are spanned, 
mathematically speaking, by the primary antenna patterns. In the case of real systems, 
the primary beams h m (f2) : m G {1, • ■ • , M}, which are the individual beams in the 
presence of all of the antennas, including scattering, are likely to be different to one 
another; for example, the antennas at the edge of an array will have different beam 
patterns to those in the middle. In addition, the primary beams may have far-out 
sidelobes, which can couple to warm radiating objects such as the ground. Ideally, it 
must be possible to span all possible synthesised beams, as the source moves across the 
sky, including the requirement that the synthesised beams must be near zero outside 
of some field of view. If nulling is not achieved, high noise temperatures, which may 
be a strong function of may result. Low noise temperatures can only be achieved 
by ensuring that the reception patterns of the primary antennas can only couple to 
low-noise regions of the sky, or, formally speaking, that there are many more degrees of 
freedom in the system than are needed to simply create the main lobes of the synthesised 
beams. 

In astronomical applications, phased arrays are used to image incoherent or, in the 
case of celestial masers, partially coherent fields. It is convenient, therefore, to introduce 
correlation dyadics. Define the correlation dyadic of the incident field according to 

I(n 1 ,n 2 ) = (E(n 1 )E*(n 2 )), (10) 

where E(f2) is the complex analytic representation of the quasi-monochromatic electric 
field, and ( ) denotes the ensemble average. The final result can be integrated 
with respect to frequency to calculate broadband behaviour, but we do not show 
frequency dependence explicitly. We shall assume throughout that the electric-field 
is normalized to the square root of the impedance of free space so that the elements 
of E(f2i, Q2) have the units of Wm~ 2 Sr~ 2 Hz -1 . The rank 2 tensor E(Oi, 2 ) contains 
complete information about the correlations between all pairs of transverse vector field 
components for any two points on the sky. Once the correlation dyadic is known, all 
classical measures of coherence follow. 

The correlation between the travelling wave amplitudes at any two output ports is 
given by (z p z*,), or in matrix form Z = (zz^) G C PxP . The elements of Z can be found 
by using jTJ and ^TDJ): 

Zpp>= [ / t^O-l^aA)- v(n 2 )dfi lC zo 2 , (11) 

J A J A 

and because the the travelling wave amplitudes are normalized to the square root of 
impedance, the elements of Z have the units WHz -1 . According to (iTTj) the correlations 
at the outputs are merely the matrix elements of the source coherence tensor with 
respect to the synthesised beams. 

In the case of spatially incoherent, but not necessarily unpolarized sources, 

i(ni,n 2 ) = I(ni)<y(ni-n 2 ), (12) 
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where B(f2i) contains information about the polarisation of the source, which can be 
projected onto Pauli-spin matrices or spin-weighted spherical harmonics. B(fli) is a 
brightness tensor because it has units Wm~ 2 Sr _1 Hz _1 . In the case of incoherent and 
unpolarised sources, (|T2l) becomes 

E(ni,n a ) = s(ni)I<j(ni-n 2 ), (13) 

where I is the unit dyad, and B(Cti) the brightness of each polarisation in the direction 
of tt v 

(TT3"j) can be expressed in terms of a brightness temperature, TJ,(f2), through 
= > hi/ 3 1 = 

E(n u o 2 ) = — - Ufa - fh). (14) 

c exp[hu/ kTb(ili)\ — 1 

( I14p predicts an infinite correlation dyadic when tli = Q 2 , which occurs because an 
infinitely small coherence area requires an infinitely large number of radiators to be 
packed into every finite region of the sky. Nevertheless, (j!4j) behaves correctly when 
integrated with respect to antenna power patterns, because antenna power patterns, 
even in the multimode case, have finite coherence areas. 

Numerous sources can be modelled in this way, for example, a polarized source 
can be combined with unpolarized emission from the atmosphere having brightness 
temperature T s , to give 

~hv 3 1 



E(ni,n 2 ) 



i+bo; 



<y(ni-n 2 ). (is) 



c 2 exp[hv/kT s ] — 1 

Some care is needed when T s is not uniform because the source is in the far field of the 
array whereas the atmosphere is usually in in the near field; these effects can be taken 
into account, but we shall not do so here. 

Now illuminate a phased array with an unpolarised, incoherent source. Substituting 
f fT3|) in fTTTj) . the elements of the coherence matrix become 

z pp , = [ B(n)t*(£i)-t p ,(n)dn, (ie) 

J A 

the diagonal elements of which correspond to coupling the incoming radiation field to 
the synthesised power patterns, K p (£l) = t*(f2) ■ t p (fl), giving 

Z pp = [ B(tl)K p (Cl)dn. (17) 
J A 

The functions K p (£l) are direction-dependent effective areas, and we could use K p (Cl) = 
A p P p (Cl), where A p is the effective area of the p'th synthesised beam in the most 
receptive direction, and P p ((l) is the dimensionless power pattern. Substituting the 
Planck formula in (fTTj) . and assuming a uniform sky temperature, gives 

hv 3 1 
l A c 2 explhv/kTg 



A p Q p hv 
A 2 exp[hu/kT s } 
hv 

p explhv I kT s ] — 1 



Theoretical Analysis of Astronomical Phased Arrays 



7 



where n p = A p Q p /\ 2 is a coupling factor. For orthogonal primary beams, and a unitary 
beam-forming network, A p Q p /X 2 can be regarded as the number of modes in the beam, 
which is unity, and we arrive at the expected spectral power in the travelling wave. 
Generally, however, for a phased array, A p Q p /\ 2 < 1. Certainly X] p A>^pA 2 — 
because the total power cannot exceed the modal throughput of the primary antennas. 

f|T6|) shows that because the synthesised reception patterns are generally not 
orthogonal the travelling waves at the output ports are correlated, even when the sky 
brightness is uniform. If the source is not uniform, correlations exist even when the 
synthesised beams are orthogonal, which corresponds to the usual case of interferometry. 
Thus, each port of a phased array records the incident flux in the usual manner, but 
because the beams are not necessarily orthogonal, and may even be linearly dependent, 
the travelling waves at the output ports may be correlated, and the correlations will be 
more complicated than in the usual case of interferometry. 

3. Information Throughput and Image Recovery 

Consider information throughput and image recovery. Represent the primary physical 
quantities as abstract vectors. Because the incoming field, E(f2), is square integrable 
over the appropriate region of the sky, A, it can be represented by a vector |E) in 
Hilbert space H. Regions having different shapes and sizes correspond to different 
Hilbert spaces. In the case of realizable phased arrays, where the numbers of ports and 
primary antennas are finite, the measurable quantities y and z are finite-dimensional 
vectors, y G C M and z G C p , but because we wish to draw attention to the relationship 
with frame theory, and to emphasize the role of convergence, we use the more general 
representations |y) G I 2 and |z) G I 2 , where £ 2 is the space of square-summable complex 
sequences. These definitions lead to two operators, one of which, H : H — > £ 2 , maps the 
incoming electric field onto the outputs of the horns, and the other $ : i 2 — > £ 2 maps the 
outputs of the horns onto the outputs of the beam-forming network. These individual 
operators can be combined into a single composite operator T = $H : H — > £ 2 , which 
describes the system as a whole. 

The operation of phased arrays is intimately related to the mathematical theory 
of frames. Suppose that we have some general monochromatic field |E), and that we 
determine the inner products with respect to a set of basis vectors T = {\t p ), p G 
1, • • • , P}: z p = (t p |E). P can extend to infinity, and we do not make any assumptions 
about the orthonormality or linear independence of T. Under what circumstances 
can the original vector |E), which represents a continuous function, be recovered 
unambiguously from a discrete, possibly countable, set of complex coefficients, and 
how can this be achieved? In the context of phased arrays, we are asking under what 
circumstances can the form of an incident electric field be recovered unambiguously from 
the complex travelling-wave outputs. 

Frame theory [3 El [9] proceeds as follows. Evaluate the square moduli of the inner 
products between T and any general vector, |E) G H, and sum the results. If there are 
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two non-zero constants A and B such that < A < oo and < B < oo, and 

A || E f < || T|E) f < B || E || 2 , (19) 
which can also be written 

A||E|| 2 < ^T|(t p |E> H | 2 < S || E f, (20) 

p 

V |E) e H, then the basis set T is called a frame with respect to H. 

Notice the strict use of inequalities in the allowable values of A and B. In the 
case where A « B, the frame is called a tight frame because the inner products for all 
|E) G HI lie within some small range, and the dynamic range needed for inversion is 
small. When the original basis is orthonormal, the frame bounds, A and B, are equal, 
as can be appreciated by inserting |E) = \tp>) in ( l20l) . If the frame is normalized, A is 
a measure of the redundancy in the frame. If a basis set constitutes a frame, then it 
can be shown through (jT9]) alone that the electric field can be recovered unambiguously 
from the inner products, and the more tightly bound the frame, the more tightly bound 
the inverse, and the more stable the image recovery process. The frame condition, f|T9|) . 
is completely general, and applies for any continuous function, even though an infinite 
number of possibly linearly dependent basis functions may be used. Phased arrays have 
a finite number of ports, but can nevertheless span spatially band-limited functions 
having finite support, and can therefore form frames with respect to fields carrying 
finite information. 

Because the synthesised beam patterns of a phased array may be non-orthogonal, 
and even linearly dependent, the recovery of the original field, through an operator we 
shall call T _1 , is best implemented by the introduction of dual vectors, which correspond 
to dual beams. The dual vectors |t p ) of any given frame T, with respect to Hilbert space 
HI, are given by 

|t p ) = S- 1 |t p ), (21) 

where S = T^T is non-singular, and can therefore be inverted. The dual basis set, 
which we shall call T = {\t p ) , p E 1, ■ ■ ■ , P} , has the same degree of completeness as the 
original frame, T, and therefore it too constitutes a frame with respect to HI. Indeed, 
two representations of any general |E) are possible: 

|E) = $> P |E> \t p ) |E) = ^(t p |E) |tp>. (22) 
p p 
f l22|) shows that if one calculates a set of coefficients by taking the inner products with 
a frame, then one inverts the process by reconstructing the field using the dual vectors. 
In the case where the basis vectors are perfectly complete with respect to HI, but not 
necessarily orthogonal, the basis is called a Riesz basis, and the basis set T and dual set 
T are biorthogonal: (t p |ty) = 5 PP ' : Wp,p' E 1, ■ • • , P. 

In the case where the basis vectors do not constitute a frame, but an attempt is 
made to reconstruct the original field vector using the duals, 

|E')=5> p |E>|t p >, (23) 
p 
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the reconstructed vector |E') cannot, for all vectors in H, be the same as the original 
vector |E). It can be shown, however, that the error vector |E) — |E') is orthogonal 
to the basis vectors. Consequently, |E') is the orthogonal projection of |E) onto §, 
the subspace spanned by the under complete set of basis vectors. In other words the 
solution is as close as possible to the original field vector to within the degrees of freedom 
available. 

The relevance to phased arrays is clear; one can measure the complex outputs of a 
phased array, and if the synthesised reception patterns constitute a frame with respect 
to the Hilbert space defined by the shape, extent, and illumination of the input reference 
surface, then the continuous, coherent, incoming field can be reconstructed completely 
from the complex travelling wave amplitudes at the output ports. If the reception 
patterns do not constitute a frame, reconstruction leads to the least square fit that is 
consistent with the degrees of freedom to which the phased array is sensitive. If the 
field of interest has passed through an optical system, which can only ever transmit a 
finite number of modes, then frames can in principle be formed; if the input surface 
corresponds to the sky, frames are not possible, because the dimensionality of a field, 
even over a finite region, is infinite. It is possible, however, to form a frame with respect 
to some given class of object, as will be discussed. It is also possible to synthesize 
more and more beams to tighten a frame, thereby increasing the stability of the image 
recovery process. Central to this model is the notion of dual beams. Every synthesised 
beam has a dual beam, and the concept of dual beams is central to understanding the 
optical physics of phased arrays, and the image recovery process. 

To this point it has been assumed that the incoming field is fully coherent, but 
in the case of astronomical phased arrays the incoming field is usually incoherent, and 
sometimes only the powers are measured at C. According to ffl6l) . the self and cross 
correlations at the output ports of a phased array, for an incoherent, unpolarized source 
are given by 

z pp ,= [ B(n)s; p/ (n)dn, (24) 

J A 

where 

s; p ,(n) = t;(n)-t p ,(n), (25) 

but according to ( 1241) the elements of the correlation matrix are merely the inner 
products of B(&) with respect to S pp >(£l) V p,p' G {1,---,P}. Hence once again, 
frame theory can be used to determine the degree to which B(Cl) can be recovered 
from the correlations between the travelling waves at C. The case were only the 
powers are measured, Z pp V p 6 {1, • • • , P}, is a special case of ff2"4"|) . and corresponds 
to determining the degree to which the synthesised power patterns span the source 
brightness distributions of interest. It seems, therefore, that frames can be defined 
for either fully coherent fields with correlation measurement, for incoherent fields with 
correlation measurement, or for incoherent fields with only power measurement. We 
shall call these possibilities field frames, inter ferometric frames, and intensity frames 
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respectively. Note that the term interferometric does not necessarily imply that an 
aperture synthesis interferometer is being used, but merely that the correlations between 
the travelling waves at the outputs are measured. Each of these frames can be associated 
with dual beams of some kind. In fact, it is straightforward to formulate a more general 
theory, and ask whether a set of synthesised beams forms a frame with respect to the 
full coherence tensor field of the incoming radiation. This more general theory has 
interesting applications, but we shall not described it here. 

Suppose that the goal is to reconstruct the intensity distribution of an astronomical 
source, and one needs to know whether the basis S pp i(Cl) V p,p' £ {1, • • • , P} forms an 
interferometric frame. There is a problem, however, because in assuming that the 
source is spatially incoherent, we have tacitly assumed that the brightness distribution 
is a member of an infinite dimensional space. To answer the question correctly, we 
need to ask whether the phased array is suitable for recovering brightness from the 
vector space of the brightness distributions of interest. One approach is to describe 
the range of possible brightness distributions as a finite, weighted linear combination of 
basis functions, ip n (£l) : n £ {1, • • ■ , N}. These functions could, for example, be radial 
basis functions, or delta functions at certain sample points. The functions need not 
correspond to a single region, but could correspond to different regions of the sky that 
need to be imaged simultaneously. 

The intensity on the sky, according to our chosen class, can be written 

B(6) = J2 a nM&), (26) 

n 

and therefore, according to flM|) . the correlations between the travelling waves at the 
output ports become 

Zppi ^ a n Fppi n , (27) 

n 

where 

F vv,n= [ SL,(n)v«(n) dn. (28) 

J A 

Defining the matrix F whose elements are F qjU , where pp' is now indexed by the 
single integer q, the frame condition (|T9l) becomes 

A<a^Fa<B Va£C 7V . (29) 

(129]) has the Hermitian form, and it is well known that the stationary values of 
the Hermitian form occur when the unit vector a points in the same directions as the 
eigenvectors of F^F, with the stationary values being the corresponding eigenvalues. A 
and B, and hence the tightness of the frame, can be determined by finding the largest and 
smallest eigenvalues of the Hermitian operator F^F, which can be found analytically or 
numerically. Although, the operator F^F maps a finite dimensional space onto itself, the 
mapping passes through a space having infinite dimensions and therefore the integrals 
in ( 128]) should be evaluated analytically if at all possible. The operator F^F simply 
maps the intensity distribution coefficients of the source onto the measured quantities 
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at the output ports of the array and then back again onto the coefficients. If the basis 
functions S pp >(Cl) do not span all possible intensity distributions, information is lost 
when an observation is made, and it is not possible to recover complete information 
about the source. 

In the case where the basis functions correspond to sample points O n , we have 
ip n (£l) = 5(Cl — O n ) and F pp i n = S* ,(fi n ). Although, in practice, a frame cannot 
be formed that reproduces the sky brightness at every one of an infinite number of 
points, a frame can be formed with respect to a finite number of sample points. 
Also, shapeletsfTUl [TTl [12], which are essentially Gaussian Hermite and Gaussian 
Laguerre polynomials, are now being used to parameterize the brightness distributions 
of astronomical sources, such as galaxies. It would therefore be possible to check whether 
an imaging phased array forms a frame with respect to a set of shapelets, and to recover 
the shapelet coefficients directly from observations. 

4. Natural Modes of Phased Arrays 

Superficially speaking, the natural modes of a phased array are those orthogonal field 
distributions that can pass through the horns and beam-forming network with high 
efficiency. When the synthesised beams are linearly dependent, or even if they are simply 
non-orthogonal, the forms of the natural modes are not directly known. The natural 
modes are, however, central to understanding noise and correlations, to understanding 
quantum statistical behaviour, which is important at sub-millimetre wavelengths, and 
to understanding the operation of interferometric phased arrays. 

Suppose that a telescope is equipped with an imaging phased array. As described 
previously, the process of projecting a coherent field onto the complex travelling wave 
amplitudes at the output ports is described by the mapping T : |x) i — > |z) as 
T : HI — > C 2 . The phased array acts as a linear operator between two Hilbert spaces: the 
space of square integrable functions over the input reference surface, and the space of 
square summable complex sequences. Because the operator maps between two different 
Hilbert spaces, it does not make sense to search for the eigenf unctions of such an operator 
in an attempt to find the natural modes. 

For any real system, it is certainly known that every incoming field carries a finite 
amount of power, and the associated complex travelling waves at the output ports carry 
a finite amount of power: T is therefore bounded. More importantly, a real system 
can only transmit a limited amount of information, because a finite number of primary 
antennas is used: T is therefore Hilbert-Schmidt. It follows that the integral operator 
that maps the incoming field distribution onto the output ports can be written in the 
form [13] 

Zp = I y;<7iU?v?(n)-E(n)dn. (30) 

It is a feature of the Hilbert-Schmidt decomposition that {Vj(O) : % £ 1, • • • , P} 
and {Uf : p & P; i E P} are orthogonal sets and therefore, according to 
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( j5U|) . the operation of a phased array can be regarded as first mapping the incoming 
field onto the beams, Vj(fi), scaling by the singular values, <jj, and reconstructing the 
complex travelling wave amplitudes at the output ports through the basis vectors Uf . 
Those reception patterns, Vj(f2), associated with non-zero singular values span the field 
distributions at the input to which the phased array is sensitive, and those output vectors 
Uf , associated with non-zero singular values span the vectors at the output to which the 
phased array can couple. These are the natural modes of a phased array, and we shall 
call them the eigenfields, by analogy with bolometric interferometers [H] . In the case of 
fields that have been discretised for numerical calculation, Vj(fi), Uf , and <7j correspond 
to the singular vectors and singular values of the singular value decomposition of the 
phased array's transmission matrix T. 

It can be shown, [14], although we shall not do so here, that the eigenfields on the 
sky are the eigenf unctions of the Hermitian operator that maps the sky field onto the 
travelling waves at the output ports, and then back again onto the sky in a time reversed 
manner. The input eigenfields are those field distributions that remain unchanged in 
form after this complete round trip. Likewise, the output eigenfields are the eigenvectors 
of the Hermitian operator that maps the travelling waves at the output ports onto the 
sky in a time reversed manner, and then forward again to the output ports. The output 
eigenfields are those discrete vectors that can make this round trip unchanged in form. 

The eigenfields have many unique properties. First, they represent the primary 
paths by which information can pass from the sky to the output ports. The input 
eigenfields on the sky are mutually orthogonal, even though the synthesised beams may 
be non-orthogonal or linearly dependent. The output eigenfields, which are actually 
discrete vectors in this case, are also mutually orthogonal. Secondly, the number of 
singular values, cr.j, greater than some threshold is the modal throughput of the system. 
Thirdly, if two phased arrays are placed side-by-side, with the intention of creating an 
aperture synthesis interferometer, the input eigenfields having non-zero singular values 
associated with the different telescopes are mutually orthogonal [H], which makes them 
ideal for analysing the behaviour of interferometers. 

Now consider how the concepts of frames and eigenfields can be used to construct 
powerful models of imaging phased arrays. The correlations at the output ports can be 
found through Z pp i = (z p z*,); using ( l30i) . 

which is equivalent to (TlTl) . but with the array described in terms of its natural modes. 
f }3Tj) describes the projection of the coherence tensor of the source onto the input 
eigenfields, the eigenfield coefficients are then multiplied by the singular values, and 
the correlations at the output ports assembled through the output eigenfields. It is 
interesting to observe that if a phased array is illuminated by an incoherent, unpolarized, 
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spatially uniform source, then using (THE]) with a uniform temperature T s , and (13~T|) . gives 

Z pp' = Fu h nrr* ; 5" ^ U ? U f *> ( 32 ) 

pp exp[/iz//£;T] -1^ ! ! 

where we have followed a procedure similar to that described by f|T8|) . fl32l) shows that 
the correlations at the output are merely those associated with the incoherent excitation 
of the output eigenfields. Indeed the total power, W, becomes 

where we have used the orthogonality of the output eigenfields. Comparing with (fTHI) . 

J2a p q p /\ 2 = J2°1 < M - ( 34 ) 

v v 

The total throughput is determined by the number of singular values significantly greater 
than some threshold, usually determined by the noise. fl34|) can be regarded as a 
statement about the modal throughput of a complete array; an important performance 
metric of phased arrays can therefore be determined from the synthesised beams alone. 

5. Noise 

Consider the case where the primary antennas and beam forming network are are made 
up of passive components, such as a planar array and microstrip coupling network. 
What noise sources appear at the output as a consequence of the losses? Because the 
signal is uncorrelated with the internally generated noise, the correlation matrix at the 
output can be written 7J = Z + Z N , where Z' is the combined output, Z the output due 
to the signal, and Z N the output due to the thermal noise from the losses. 

If the array is illuminated by a uniform, unpolarized thermal source, and the losses 
in the array are at the same temperature as the source, T s , the travelling waves at 
the output will be uncorrelated, and appear to originate from a thermal source having 
temperature T s . Under these circumstances, using f fT6|) 

Z ' = \U //T1 = 2 \U //Tl 7R+ Z JV ; (35) 

exp|/iz///ci s J — 1 cr exp|/iz///ci s J — 1 

where R has elements 

R pp'= ! t;(^) • V(0)dO. (36) 

J A 



Rearranging, 

„ hv 



explnis/kls] — 1 

If the synthesised beam patterns are known, and all of the overlap integrals calculated, 
giving R, the thermal noise and the correlations at the output, Zjy, can be found. 

Now consider the case where the phased array is not passive, for example when 
HEMT amplifiers are used, but the noise temperatures of the primary receiving antennas 
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are known, and equal. By definition, the noise temperature of a receiver is the 
temperature that a matched source would need to have in order to generate the same 
output as a noiseless, but otherwise identical system. Let us call the associated noise 
correlation matrix at the output Z' N , where Z' = Z + Z' N , then to find Z' N we simply 
need to calculate the coefficients of a uniform sky having the same physical temperature 
as the noise temperatures of the receivers. Using (fl4"]l and ffTTj) we get 

ZAr = \u nr i T T2 R ' ( 38 ) 

exp[np/ kin] — 1 A z 

where T/v is the noise temperature of the individual receivers. In both cases, if the 
beams are orthogonal, the noise sources are uncorrelated. In the case where the primary 
receiver temperatures are different, a similar scheme can easily be established. 



6. Fluctuations and Sensitivity 

The last step in the analysis is to determine the fluctuations in power and the correlations 
between the fluctuations in power at the output ports of a phased array once Z, or Z', 
is known; after all, it is the fluctuations that ultimately determine the sensitivity of an 
instrument. Following recent work on multimode detectors [T4"l [15] . it is straightforward 
to determine the fluctuations in any power-related measurement that is made at the 
output ports. 

If Z is the coherence matrix of the complex travelling wave amplitudes at the output 
ports, and W G C PxP is a matrix that characterises the nature of the measurement 
being made, the expectation value of the measurement, (P), is given by 

p = (P) = Av TrWZ, (39) 

where Av is the RF bandwidth. ([39]) can be appreciated by remembering that the trace 
of the product of two matrices constitutes an inner product in the abstract vector space 
of matrices. Thus, (139]) describes the way in which the state of coherence of the source 
projects onto the state of coherence to which the measurement system is sensitive. The 
structure of (139]) has be explored in detail, and leads to the coupled-mode theory of 
power detection[15l [161 EH]- 

More significantly, it has also been shown [T6] that if two measurements are made, 
one represented by W a and one by W b , then the covariance of the measurements, 
Cov [P a , P b ] = ((P a - P a )(P b - P b )), is given by 

Cov [P a , P b ] = — [TrW a ZW b Z + <5 ab /iz/ TrW a Z] , (40) 

where Uq is the central RF frequency, and r the time for which the output is integrated. 
([40]) is valid for thermal sources, and when the integration time is much longer than 
either the coherence time of the source or the intrinsic response time of the detector. 
It is valid, therefore, for astronomical phased arrays. The first term in (|4"0"1) can be 
identified with the noise associated with classical waves, whereas the second term with 
the noise associated with photon counting. Thus, (jlDj) contains the transition from fully 
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bunched to Poisson statistics as the wavelength of operation moves from the microwave 
range through into the far infrared. (HOl) contains a considerable amount of detail, 
including partition noise, and the randomization of photon arrival times due to losses 
and imperfect quantum efficiencies. 

We can imagine three principle measurements at the output ports. First consider 
a measurement having the matrix elements 

W§ = V^iAa, (41) 

where i,j G {1, ■ • • , P}, which is diagonal with just one non-zero element. Substituting 
HD in (|39H gives 



(42) 



j 1 



ij 

= AvifZ aa . 

ETJ) therefore corresponds to measuring the expectation value of the power at port a, 
using a detector having quantum efficiency vj 1 . Likewise, a power measurement at port 
b is characterised by 

Wg = v%j$ib- (43) 

In the case of a phased array where only powers are measured, which would 
correspond to an ordinary imaging array with the appropriate synthesised beams, (j4ip 
and ( 1431) can be substituted into ( 140]) to give 



Au 



Cov [P a , P b ] = — [v a V b \Z ab \ 2 + S ab hu oV a Z aa ] . 



(44) 



When we are interested in the true fluctuations in the power at port a, as distinct 
from the fluctuations in the absorbed power, then a = b and r] a = rf = 1, and the root 
mean square fluctuation, AP a , is given by 

1/2 



AP a = {Cov[P a ,P a ]} irz = [ — 



hv Z aa ] 



1/2 



(45) 



We can also form the quantity, which is the noise in the measurement normalised 
by the signal, 



-1/2 



or 



AP a 

AP a _ 1 
JP^) ~ (Aut) 1 / 2 



+ 



hu 



Au AvZ n 



1 + 



Au 



1/2 



■,1/2 



n 



(46) 



(47) 



where n = AvZ aa jhv§ is the average photon rate. In the case where the photon rate is 
much greater than the bandwidth, or equivalently many photons arrive in the radiation's 
coherence time, the sensitivity scales according to the radiometer equation; in the case 
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where very few photons arrive in the coherence time, the sensitivity scales according to 
photon counting. 

Likewise the root mean square correlation, AP ab , between the fluctuations in two 
different power measurements, a ^ b, is given by (144"I) as 

1/2 



AP ab ={Cov[P a ,P b ]} 1/2 = f^j \Z ab \ 



(48) 



and we also have 

Apab _ 1 \ Z *b\ _ 1 iT-ofci (49) 



(pa)i/2( P by/2 (Ai/r)V2 (Z aa Z bb y/ 2 {Aut) 1 / 21 

where |r afe | is the correlation coefficient. ( H9l) is the Hanbury Brown-Twiss effect for 
phased arrays. It does not show Poisson noise because the Poisson term corresponds 
to photon arrival times that are uncorrelated. In the case where the synthesised beams 
are orthogonal, and the source uniform, Z ab = 0, and the fluctuations in the powers 
measured at different ports are uncorrelated. In the case of non-orthogonal beams, or 
a non-uniform source on the sky, the fluctuations are correlated, and the correlations 
must be taken into account when calculating the noise in the reconstructed image. 

Now consider the case where we wish to measure the correlations between the 
travelling waves at the output ports, say a and b, for the purpose of recovering an 
image, as in done in interferometry. In this context, there are two measurement-system 
matrices of interest: 

1 

2 



R$ = V ab ^ ia S jb + S ib S ja ) (50) 



where a ^ b, and the i in the prefactor corresponds to the unit imaginary, rather than 
the index, with no confusion. The two matrices comprising the elements listed in (|50|) 
are Hermitian, and characterise realizable measurements. 
Substituting fl50|) in fl39|). 

(R ab ) = Au7] ab ReZ ab (51) 
(I ab ) = Au V ab lmZ ab , 

and the two measurements correspond to finding the real and imaginary parts of the 
correlations between ports a and b, which is equivalent to finding the in and out of 
phase components of the fringe formed when the signals at a and b are combined. More 
specifically, the matrices correspond to using an analogue correlator to measure the cos 
and sin fringes. 

We can now substitute ( 1501) into (HO]) to give 

1 



Cov \R ab } R ab ] = — 

T 



( V ab y-(Z aa Z bb + ReZ 2 ab ) + hu oV ab ReZ ab 



giving a noise to signal ratio of 

AR ab _ 1 [\(Z aa Z bb + ReZ 2 ab ) + hu ReZ ab ] 1/2 



(R ab ) {Aut) 1 / 2 ReZ, 



ab 



(52) 



(53) 
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for r] ab = 1, which corresponds to the actual power. For a completely incoherent source, 
and orthogonal beams, Z ab = 0, the noise becomes 



AR 



■ab 



1 1 1/2 

(Al/T^Tf (( P aa)(Pbb)) 



(54) 



and the noise is classical, with the fluctuations being proportional to the geometric 
means of the two power levels, which is a well known feature of noise in interferometers. 
In the case where the two signals at a and b are fully coherent, Z aa Z bb 

get 



AR 



ab 



or 



(R ab ) 

AR ab 



-1/2 



+ 



hv 



Au Au\Z nb \ cos 9, 



Tab 



1/2 



\Zab\ , we 
(55) 



Av 
1 + — 

n r 



1/2 



(R ab ) (Az/r)V2 ^ 1 n c J ^ 
where n c is the mean number of photons in the in-phase correlated power, and 9 ab is the 
phase of Z ab . Thus, for weak fringes, and even in the nulled region of a strong fringe, 
classical noise is swapped for Poisson noise. In general, the nature of the noise will 
change throughout the fringe. 
Similarly, we have 



giving 



Cov [I a \ I ab ] 



AI ab 



Av 



{r] ab f l -{Z aa Z bb - ReZ 2 ab ) + hu oV ab lmZ ab 



[\(Z aa Z bb - ReZ 2 ab ) + hu lmZ ab ] 



1/2 



(57) 



(58) 



{I ab } (Aut) 1 / 2 lmZ ab 
for r] ab = 1. 

For incoherent travelling waves, the noise is the same as for the in-phase 
measurement, (I54p . For fully coherent signals, 



AI 



ab 



or 



(I ab ) 
AI ab 



-1/2 



hv 



1 

Av Av\Z nb \ sin 9, 



'ab 



1/2 



(I ab ) (Az/r)V2 



1 + 



Av 



ll/2 



(59) 



(60) 



where n s is the mean number of photons in the quadrature component of the correlated 
power. Overall, the same behaviour is seen as for the in-phase component. 

Finally, we can correlate the fluctuations in the in-phase and quadrature 
measurements, giving 



Cov [R ab , I ab ] 



Av 



T 



-ImZ, 



ab 



and 



ARI 



ab 



( R aby/2(Jaby/2 ( Al/ r )l/2 ^ 



ReZ ab ImZ, 



ab 



1/2 



(61) 



(62) 
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giving 

1/2 (Jab) 1/2 (Aut) 1 / 2 ' 1 J 

It is also possible to calculate the correlations between the fluctuations in the real and 
imaginary parts of fringe measurements on two different pairs of ports. 

7. Conclusion 

We have analyzed the behaviour of phased arrays from a functional perspective, and 
shown that their operation is intimately related to the mathematical theory of frames. 
In cases where the beams are non-orthogonal, or even linearly dependent, image 
reconstruction can be carried out using dual beams. The theory of frames allows one to 
assess, in a straightforward manner, whether the powers or correlations at the output 
ports of a phased array contain sufficient information to allow some class of field or 
intensity distribution to be reconstructed precisely. We have also identified the natural 
modes of phased arrays, which are important for understanding information throughput 
and aperture synthesis interferometry. 

In order to calculate the behaviour of an imaging phased array it is only necessary 
to know the synthesised reception patterns, which may be non-orthogonal. It is 
not necessary to know anything about the internal construction of the array itself. 
As a consequence, data can be taken from experimental measurements or from 
electromagnetic simulations. The ability to assess behaviour simply from the synthesised 
beams separates the process of choosing the best beams for a given application from 
the process of realizing the beams in practice. It also suggests important techniques for 
analysing experimental data. 

Our model allows the straightforward calculation of quantities such as the 
correlations in the fluctuations at the output ports of an phased array. Indeed, sources 
can be constructed through expressions of the kind (|T5|) . the correlations at the output 
ports calculated through (fill , or in the modal case (13TT) . system noise can be included 
through fl37|) and (1381) . and the expectation values, fluctuations, and correlations between 
fluctuations in measurements at the output determined through ( l39i) and ( HOI) . The 
whole procedure only requires simple matrix algebra. The scheme is conceptually 
and numerically powerful|18j. and in an upcoming paper we shall present simulations 
showing dual beams, natural modes, and image reconstructions. 
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